Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

An improved physically-based method for geometric restoration of distorted document images.

Identifieur interne : 000B23 ( Main/Exploration ); précédent : 000B22; suivant : 000B24

An improved physically-based method for geometric restoration of distorted document images.

Auteurs : Li Zhang [Singapour] ; Yu Zhang ; Chew Tan

Source :

RBID : pubmed:18276976

English descriptors

Abstract

In document digitization through camera-based systems, simple imaging setups often produce geometric distortions in the resultant 2D images because of the non-planar geometric shapes of certain documents such as thick bound books, rolled, folded or crumpled materials, etc. Previous works have demonstrated that arbitrary warped documents can be successfully restored by flattening a 3D scan of the document. These approaches use physically-based or relaxation-based techniques in their flattening process. While this has been demonstrated to be effective in rectifying the image content and improving OCR, these previous approaches have several limitations in terms of speed and stability. In this paper, we propose a distance-based penalty metric to replace the mass-spring model and introduce additional bending resistance and drag forces to improve the efficiency of the existing approaches. The use of Verlet integration and special plane collision handling schemes also help to achieve better stability without sacrificing efficiency. Experiments on various document images captured from books, brochures and historical documents with arbitrary warpings have demonstrated large improvements over the existing approaches in terms of stability and efficiency.

DOI: 10.1109/TPAMI.2007.70831
PubMed: 18276976


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">An improved physically-based method for geometric restoration of distorted document images.</title>
<author>
<name sortKey="Zhang, Li" sort="Zhang, Li" uniqKey="Zhang L" first="Li" last="Zhang">Li Zhang</name>
<affiliation wicri:level="4">
<nlm:affiliation>School of Computing, National University of Singapore, 3 Science Drive #2, Singapore. zhangli@comp.nus.edu.sg</nlm:affiliation>
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive #2</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author>
<name sortKey="Zhang, Yu" sort="Zhang, Yu" uniqKey="Zhang Y" first="Yu" last="Zhang">Yu Zhang</name>
</author>
<author>
<name sortKey="Tan, Chew" sort="Tan, Chew" uniqKey="Tan C" first="Chew" last="Tan">Chew Tan</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PubMed</idno>
<date when="2008">2008</date>
<idno type="doi">10.1109/TPAMI.2007.70831</idno>
<idno type="RBID">pubmed:18276976</idno>
<idno type="pmid">18276976</idno>
<idno type="wicri:Area/PubMed/Corpus">000051</idno>
<idno type="wicri:Area/PubMed/Curation">000051</idno>
<idno type="wicri:Area/PubMed/Checkpoint">000051</idno>
<idno type="wicri:Area/Ncbi/Merge">000049</idno>
<idno type="wicri:Area/Ncbi/Curation">000049</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000049</idno>
<idno type="wicri:doubleKey">0162-8828:2008:Zhang L:an:improved:physically</idno>
<idno type="wicri:Area/Main/Merge">000B35</idno>
<idno type="wicri:Area/Main/Curation">000B23</idno>
<idno type="wicri:Area/Main/Exploration">000B23</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en">An improved physically-based method for geometric restoration of distorted document images.</title>
<author>
<name sortKey="Zhang, Li" sort="Zhang, Li" uniqKey="Zhang L" first="Li" last="Zhang">Li Zhang</name>
<affiliation wicri:level="4">
<nlm:affiliation>School of Computing, National University of Singapore, 3 Science Drive #2, Singapore. zhangli@comp.nus.edu.sg</nlm:affiliation>
<country xml:lang="fr">Singapour</country>
<wicri:regionArea>School of Computing, National University of Singapore, 3 Science Drive #2</wicri:regionArea>
<orgName type="university">Université nationale de Singapour</orgName>
</affiliation>
</author>
<author>
<name sortKey="Zhang, Yu" sort="Zhang, Yu" uniqKey="Zhang Y" first="Yu" last="Zhang">Yu Zhang</name>
</author>
<author>
<name sortKey="Tan, Chew" sort="Tan, Chew" uniqKey="Tan C" first="Chew" last="Tan">Chew Tan</name>
</author>
</analytic>
<series>
<title level="j">IEEE transactions on pattern analysis and machine intelligence</title>
<idno type="ISSN">0162-8828</idno>
<imprint>
<date when="2008" type="published">2008</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Artifacts</term>
<term>Artificial Intelligence</term>
<term>Automatic Data Processing (methods)</term>
<term>Documentation (methods)</term>
<term>Image Enhancement (methods)</term>
<term>Image Interpretation, Computer-Assisted (methods)</term>
<term>Imaging, Three-Dimensional (methods)</term>
<term>Pattern Recognition, Automated (methods)</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Automatic Data Processing</term>
<term>Documentation</term>
<term>Image Enhancement</term>
<term>Image Interpretation, Computer-Assisted</term>
<term>Imaging, Three-Dimensional</term>
<term>Pattern Recognition, Automated</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Artifacts</term>
<term>Artificial Intelligence</term>
<term>Reproducibility of Results</term>
<term>Sensitivity and Specificity</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In document digitization through camera-based systems, simple imaging setups often produce geometric distortions in the resultant 2D images because of the non-planar geometric shapes of certain documents such as thick bound books, rolled, folded or crumpled materials, etc. Previous works have demonstrated that arbitrary warped documents can be successfully restored by flattening a 3D scan of the document. These approaches use physically-based or relaxation-based techniques in their flattening process. While this has been demonstrated to be effective in rectifying the image content and improving OCR, these previous approaches have several limitations in terms of speed and stability. In this paper, we propose a distance-based penalty metric to replace the mass-spring model and introduce additional bending resistance and drag forces to improve the efficiency of the existing approaches. The use of Verlet integration and special plane collision handling schemes also help to achieve better stability without sacrificing efficiency. Experiments on various document images captured from books, brochures and historical documents with arbitrary warpings have demonstrated large improvements over the existing approaches in terms of stability and efficiency.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Singapour</li>
</country>
<orgName>
<li>Université nationale de Singapour</li>
</orgName>
</list>
<tree>
<noCountry>
<name sortKey="Tan, Chew" sort="Tan, Chew" uniqKey="Tan C" first="Chew" last="Tan">Chew Tan</name>
<name sortKey="Zhang, Yu" sort="Zhang, Yu" uniqKey="Zhang Y" first="Yu" last="Zhang">Yu Zhang</name>
</noCountry>
<country name="Singapour">
<noRegion>
<name sortKey="Zhang, Li" sort="Zhang, Li" uniqKey="Zhang L" first="Li" last="Zhang">Li Zhang</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000B23 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000B23 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     pubmed:18276976
   |texte=   An improved physically-based method for geometric restoration of distorted document images.
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:18276976" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a OcrV1 

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024